Towards an Evolutionary Computational Approach to Articulatory Vocal Synthesis with PRAAT
نویسندگان
چکیده
This paper presents our current work into developing an evolutionary computing approach to articulatory speech synthesis. Specifically, we implement genetic algorithms to find optimised parameter combinations for the re-synthesis of a vowel using the articulatory synthesiser PRAAT. Our framework analyses the target sound using Fast Fourier Transform (FFT) to obtain formant information, which is then harnessed in a fitness function applied to a real valued genetic algorithm using a generation size of 75 sounds over 50 generations. In this paper, we present three differently configured genetic algorithms (GAs) and offer a comparison of their suitability for elevating the average fitness of the re-synthesised sounds.
منابع مشابه
Factors Influencing Vocal Pitch in Articulatory Speech Synthesis: A Study Using PRAAT
An extensive study on the parameters influencing the pitch of a standard speaker in articulatory speech synthesis is presented. The speech synthesiser used is the articulatory synthesiser in PRAAT. Categorically, the repercussion of two parameters: Lungs and Cricothyroid on the average pitch of the synthesised sounds is studied. Statistical analysis of synthesis data proclaim the extent to whic...
متن کاملThe history of articulatory synthesis at Haskins laboratories
Articulatory synthesis is a computational technique for synthesizing speech by controlling the shape of the vocal tract over time. Research at Haskins Laboratories on articulatory synthesis began with the arrival of Paul Mermelstein in the early 1970s. While at Bell Laboratories, Mermelstein had developed a vocal tract model that is often referred to as the Mermelstein model [1]. This model is ...
متن کاملAn Analysis-by-Synthesis Approach to Vocal Tract Modeling for Robust Speech Recognition Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Electrical and Computer Engineering
In this thesis we present a novel approach to speech recognition that incorporates knowledge of the speech production process. The major contribution is the development of a speech recognition system that is motivated by the physical generative process of speech, rather than the purely statistical approach that has been the basis for virtually all current recognizers. We follow an analysis-by-s...
متن کاملArticulatory Synthesis Based on Real-Time Magnetic Resonance Imaging Data
This paper presents a methodology for articulatory synthesis of running speech in American English driven by real-time magnetic resonance imaging (rtMRI) mid-sagittal vocal-tract data. At the core of the methodology is a time-domain simulation of the propagation of sound in the vocal tract developed previously by Maeda. The first step of the methodology is the automatic derivation of air-tissue...
متن کاملAcoustic to articulatory inversion
The context of this work is speech analysis. The subject deals with acoustic-to-articulatory inversion, i.e. the recovery of the temporal evolution of the vocal tract shape from the signal. This topic is important because it is likely to give rise to applications in the domains of speech coding as well as second language learning. Acoustic-to-articulatory inversion relies on an analysis by synt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015